FP-Hadoop

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data

Big data parallel frameworks, such as MapReduce or Spark have been praised for their high scalability and performance, but show poor performance in the case of data skew. There are important cases where a high percentage of processing in the reduce side ends up being done by only one node. In this demonstration, we illustrate the use of FP-Hadoop, a system that efficiently deals with data skew ...

متن کامل

FP-Hadoop: Efficient processing of skewed MapReduce jobs

Nowadyas, we are witnessing the fast production of very large amount of data, particularly by the users of online systems on the Web. However, processing this big data is very challenging since both space and computational requirements are hard to satisfy. One solution for dealing with such requirements is to take advantage of parallel frameworks, such as MapReduce or Spark, that allow to make ...

متن کامل

Sentiment Analysis on Hadoop with Hadoop Streaming

Ideas and opinions of peoples are influenced by the opinions of other peoples. Lot of research is going on analysis of reviews given by peoples. Sentiment analysis is the major computational technique to calculate or observe sentiments of people's thoughts. Therefore, a method that assigns scores indicating positive and negative opinion about the product is proposed. It uses Hadoop Distrib...

متن کامل

363.258 fp

~Tn'i's project used three methods of DNA identification: (1) DNA typing using single locus probes, (2) DNA fingerprinting with multi-locus probes, and (3) polymerase chain reaction (PCR) for sex determination. The hypothesis to test was whether DNA profiles remain constant over time, including the lifetime of an individual. This has relevance to the potential establishment of computer database...

متن کامل

nonordfp: An FP-growth variation without rebuilding the FP-tree

We describe a frequent itemset mining algorithm and implementation based on the well-known algorithm FPgrowth. The theoretical difference is the main data structure (tree), which is more compact and which we do not need to rebuild for each conditional step. We thoroughly deal with implementation issues, data structures, memory layout, I/O and library functions we use to achieve comparable perfo...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the VLDB Endowment

سال: 2015

ISSN: 2150-8097

DOI: 10.14778/2824032.2824085